Enterprise Database Systems
Apache Storm Introduction
Apache Storm Introduction - API and Topology
Apache Storm Introduction – Architecture and Installation

Apache Storm Introduction - API and Topology

Course Number:
df_apsi_a02_it_enus
Lesson Objectives

Apache Storm Introduction - API and Topology

  • start the course
  • use Trident for a simple topology
  • describe topology state management with Trident
  • describe the different types of Trident spouts available for implementing fault-tolerant Trident state management
  • describe the different Trident State APIs available for implementing fault-tolerant Trident state management
  • describe distributed RPC model and how it is used with Apache Storm
  • describe DRPC modes of operation and topology types
  • deploy a Trident topology to a Storm cluster
  • describe the Storm UI home page
  • launch a Storm topology to a local cluster and view cluster activity in the Storm UI
  • analyze a Storm topology using the Storm UI
  • describe the process of using the Nimbus Thrift client for obtaining Storm cluster metrics
  • set up a Maven project in Eclipse IDE that can be used to write Java client code for connecting to a Nimbus Thrift server
  • write Java client code that connects to a Nimbus Thrift server and retrieves Storm cluster statistics
  • describe the general architecture of Apache Kafka
  • describe Kafka components and data model
  • produce and consume a Kafka topic
  • consume Kafka messages in a Storm topology
  • describe some options for using Storm's Core APIs to implement micro-batching in a Storm Core topology
  • describe Apache Hadoop's use with Storm
  • download and install Apache Hadoop on a development machine
  • describe how Apache Storm applications can be run on Hadoop YARN clusters to leverage YARN resource management
  • describe the Puppet architecture and some key framework components
  • describe how JMX and Ganglia can be integrated and used to monitor Storm clusters
  • describe how HBase and Redis can be integrated and used as datastores with Apache Storm
  • integrate and use JMX in Storm to obtain Storm Nimbus and Supervisor metrics
  • demonstrate increased knowledge of configuring and installing Apache Storm

Overview/Description
Storm makes it easy to stream massive amounts of unbounded streams of data while providing a fault-tolerant system. This system accompanied with other integrations can take any business to the next level. In this course, you will be introduced to Trident and its relationship to Storm, along with its integration. Many other integrations will be explored including Hadoop, Kafka, JMX, Ganglia, and even automation using Puppet as well as monitoring and analytics tools. In this course, you will also learn how to deploy the Storm architecture.

Target Audience
Individuals interested in how to perform an installation and configure it, as well as design and create basic Storm topologies

Apache Storm Introduction – Architecture and Installation

Course Number:
df_apsi_a01_it_enus
Lesson Objectives

Apache Storm Introduction – Architecture and Installation

  • start the course
  • describe in a higher scope, Apache Storm and its characteristics
  • describe why Apache Storm is used
  • describe the Apache Storm Architecture
  • identify a tuple and a bolt and their use in Storm
  • identify a spout and its use in Storm
  • identify streams and their use in Storm
  • describe the different operation modes of Storm
  • identify Storm components and their functionality in the source code for an example Storm application
  • describe the setup process for an Integrated Storm development environment
  • use Maven to compile and run a Storm application
  • describe the installation and setup process for ZooKeeper as a standalone server
  • install and set up ZooKeeper on a development machine
  • deploy a ZooKeeper server in standalone mode and test it with a ZooKeeper client connection
  • describe the process for setting up and deploying a ZooKeeper cluster
  • demonstrate the process of setting up a production Storm cluster
  • describe the process of configuring the parallelism of a topology
  • configure the parallelism of spout and bolt components in a Storm topology
  • describe briefly stream groupings and their types
  • use stream groupings in a Storm topology
  • describe the Guaranteed Messaging Process
  • describe the fault-tolerant characteristics of Storm
  • describe briefly what Trident is and how it's used
  • describe Trident's data model and its use
  • describe several operations of Trident
  • test your knowledge of Apache Storm and the components of the system

Overview/Description
Apache Storm is a fast and scalable open source distribution system that drives real-time computations. Storm makes it easy to reliably process unbounded streams of data, doing for real-time processing what Hadoop did for batch processing. Storm is simple, can be used with any programming language, and is a lot of fun. This course will introduce Storm providing an in depth approach to the concepts and how Apache Storm ticks. Apache Storm can integrate with many other products and this course will introduce you to ZooKeeper along with the Storm Trident Architecture and how they are used with Storm model.

Target Audience
Individuals interested in learning the architecture of Apache Storm and how it can be used.

Close Chat Live